Inference for Clustered Data
نویسندگان
چکیده
This article introduces clusteff, a new Stata command for checking the severity of cluster heterogeneity in cluster robust analyses. Cluster heterogeneity can cause a size distortion leading to underrejection of the null hypothesis. Carter, Schnepel, and Steigerwald (2015) develop the effective number of clusters to reflect a reduction in the degrees of freedom, thereby mirroring the distortion caused by assuming homogenous clusters. clusteff generates the effective number of clusters. We provide a decision tree for cluster robust analysis, demonstrate the use of clusteff, and recommend methods to minimize the size distortion.
منابع مشابه
Flexible Bayesian quantile regression for independent and clustered data.
Quantile regression has emerged as a useful supplement to ordinary mean regression. Traditional frequentist quantile regression makes very minimal assumptions on the form of the error distribution and thus is able to accommodate nonnormal errors, which are common in many applications. However, inference for these models is challenging, particularly for clustered or censored data. A Bayesian app...
متن کاملSemiparametric Marginal and Association Regression Methods for Clustered Binary Data.
Clustered data arise commonly in practice and it is often of interest to estimate the mean response parameters as well as the association parameters. However, most research has been directed to inference about the mean response parameters with the association parameters relegated to a nuisance role. There is little work concerning both the marginal and association structures, especially in the ...
متن کاملStatistical Analysis of Clustered Data using SAS System
ABSTRACT Clustered data is very common, such as the data from paired eyes of the same patient, from multiple teeth of the same mouth, from animals of the same litter, from siblings in the same family. The key feature of clustered data is that outcomes from the same cluster are likely to be positively correlated. The proper analysis of clustered data requires taking this correlation into conside...
متن کاملEstimated estimating equations: Semiparametric inference for clustered/longitudinal data
We introduce a flexible marginal modelling approach for statistical inference for clustered/longitudinal data under minimal assumptions. This estimated estimating equations (EEE) approach is semiparametric and the proposed models are fitted by quasi-likelihood regression, where the unknown marginal means are a function of the fixed-effects linear predictor with unknown smooth link, and variance...
متن کاملA hidden Markov model for haplotype inference for present-absent data of clustered genes using identified haplotypes and haplotype patterns
The majority of killer cell immunoglobin-like receptor (KIR) genes are detected as either present or absent using locus-specific genotyping technology. Ambiguity arises from the presence of a specific KIR gene since the exact copy number (one or two) of that gene is unknown. Therefore, haplotype inference for these genes is becoming more challenging due to such large portion of missing informat...
متن کاملInference Processes on Clustered Partial Decision Rules
The aim of the paper is to study efficeient of inference process using clustered partial decision rules. Partial decision rules are constructed by greedy algorithm. They are clustered with Agglomerative Hierarchical Clustering (AHC) algorithm. We study how exact and partial decision rules clustered by AHC algorithm influence on inference process in knowledge base. Clusters of rules are a way of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017